Comparing the Memory System Performance of DSS Workloads on the HP V-Class and SGI Origin 2000
نویسندگان
چکیده
In this paper, we present an in-depth analysis of the memory system performance of the DSS commercial workloads on two state-of-the-art multiprocessors: the SGI Origin 2000 and the HP V-Class. Our results show that a single query process takes almost the same amount of cycles in both machines. However, when multiple query processes run simultaneously on the system, the execution time tends to increase more in SGI Origin 2000 than in HP V-Class due to the more expensive communication overhead in SGI Origin 2000. We also show how the rate at which number of data cache misses, context switches and the overall execution time increases when more query processes run simultaneously.
منابع مشابه
Comparing OpenMP, HPF, AND MPI Programming: A Study Case
This paper presents a comparison of three programming models—OpenMP, HPF, and MPI—applied to a diphasic compressible fluid mechanics code. The parallelization analysis is conducted, and the authors also present the experimental results obtained on various platforms: a Compaq Proliant 6000 (4 processors), a Cray T3E-750 (300 processors), an HP Class V (16 processors), a SGI Origin 2000 (32 proce...
متن کاملLessons Learned when Comparing Shared Memory and Message Passing Codes on Three Modern Parallel Architectures
A serial Fortran 77 micromagnetics code, which simulates the behaviour of thin-lm media, was parallelised using both shared memory and message passing paradigms, and run on an SGI Challenge, a Cray T3D and an SGI Origin 2000. We report the observed performance of the code, noting some important eeects due to cache behaviour. We also demonstrate how certain commonly-used presentation methods can...
متن کاملParallel Sequence Mining on Shared-Memory Machines
We present pSPADE, a parallel algorithm for fast discovery of frequent sequences in large databases. pSPADE decomposes the original search space into smaller suffix-based classes. Each class can be solved in main-memory using efficient search techniques, and simple join operations. Further each class can be solved independently on each processor requiring no synchronization. However, dynamic in...
متن کاملPerformance analysis of AlphaServer GS1280
This paper evaluates performance characteristics of the HP AlphaServer GS1280 shared-memory multiprocessor system. The GS1280 system contains up to 64 Alpha 21364 (EV7) CPUs connected together via a torus-based interconnect. We describe architectural features of the GS1280 system. We compare and contrast the GS1280 to the previous-generation Alpha systems (AlphaServer GS320 and ES45/SC45), as w...
متن کاملA Memory-Centric Characterization of ASCI Applications Via a Combined Approach of Statistical and Empirical Analysis
Memory latency is a substantial contributor to single processor performance loss. Latency hiding techniques such as out of order and speculative execution, outstanding loads to memory, and increases in cache size, have been implemented by chip designers to alleviate this bottleneck. Incorporation of such techniques in superscalar processors has made application performance evaluation a more di ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002